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ABSTRACT 

We study the high column density regime of the HI column density distribution function and argue that there 
are two distinct features: a turnover at Nm ~ 10 21 cirT 2 which is present at both z — and z ~ 3, and a lack of 
systems above Nm ~ 10 22 cm -2 at z — 0. Using observations of the column density distribution, we argue that 
the HI-H2 transition does not cause the turnover at Nm ~ 10 21 cm 2 , but can plausibly explain the turnover 
at Nm > 10 22 cm 2 . We compute the HI column density distribution of individual galaxies in the THINGS 
sample and show that the turnover column density depends only weakly on metallicity. Furthermore, we show 
that the column density distribution of galaxies, corrected for inclination, is insensitive to the resolution of 
the HI map or to averaging in radial shells. Our results indicate that the similarity of HI column density 
distributions at z — 3 and z — is due to the similarity of the maximum HI surface densities of high-z and 
low-z disks, set presumably by universal processes that shape properties of the gaseous disks of galaxies. 
Using fully cosmological simulations, we explore other candidate physical mechanisms that could produce a 
turnover in the column density distribution. We show that while turbulence within GMCs cannot affect the DLA 
column density distribution, stellar feedback can affect it significantly if the feedback is sufficiently effective 
in removing gas from the central 2-3 kpc of high-redshift galaxies. Finally, we argue that it is meaningful to 
compare column densities averaged over ~ kpc scales with those estimated from quasar spectra which probe 
sub-pc scales due to the steep power spectrum of HI column density fluctuations observed in nearby galaxies. 



1. INTRODUCTION 

The statistical distribution of HI column densities is one 
of the most widely used statistics to describe the statisti- 
cal properties of atomic hydrogen absorption systems seen 
in quasar spectra over a wide range of redshifts. Recently, 
high-resolution HI maps of nearby galaxies have been used 
to study HI column density distributions of both individual 
galaxies and statistically fo r the entire galaxy population (e.g., 
IZwaan & Proch aska 2006). Current measurements of the col- 
umn density distribution span almost ten orders of magnitude 
in column density, Nm, ranging from the highly ionized gas in 
the intergalactic medium (IGM), Nm ~ 10 -1.6xl0 17 cm" 2 , 
to the predominantly neutral gas associated with the inter- 
stellar medium (ISM) of galaxies and their satellites, Nm 



10 cm 2 (e.g., [Prochaska etal. 2010). Since physically di- 
verse systems contribute to the distribution, a theoretical un- 
derstanding of the entire HI column density distribution re- 
quires understanding IGM gas as well as gas within galaxies 
and the circumgalactic medium. As such, the HI column den- 
sity distribution is a key test of the CDM structure formation 
paradigm. 

Initial measurements of the HI column density distribu- 
tion were consistent with a single po wer-law funct ion over 
the entire column density range (e.g., Tytleii ll987l) . How- 
ever, subsequent surveys have found considerably more struc- 
ture: e.g., flattening at N m ~ 10 16 - 10 18 cnr 2 dPefitjean et alJ 
1993), and steepening of the function at higher column den- 
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sities corresponding to the damped Lyman a (DLA; Nm > 
2 x 10 20 cnT 2 ) systems, an d in particular, pronounced steep- 
ening at /Vht > 10 21 cm" 2 (IStorrie-Lombardi & Wolfdl2000l: 
I Prochaska et alJ 12005b IWolfe et all 12005b iNoterdaeme et alJ 
120091) . Most recently. IProchaska et all (l2010h found that the 
HI column density distribution has six different regimes, each 
approximated by its own power law, which intersect at col- 
umn densities N m = [10 14 5 , 10 173 , 10 19 , 10 203 , 10 2L7S ] cnr 2 . 

This rich structure is presumably the result of physically 
distinct populations of absorbers. At the lowest column den- 
sities of the Lyman a forest, Nm ~ 10 12 - 10 17 cm" 2 , ab- 
sorbers arise in the highly ionized intergalactic gas and both 
the shape of the column density distribution and its evolu- 
tion with redsh ift are now fairly well understood (e.g., see 
Meiksin 2009, for a recent review). At column densities 
of 7Vhi ~ 10 17 - 10 20 cm" 2 the absorbers arise in Lyman 
Limit Systems (LLS). These systems are thought to arise in 
circumgalactic gas and clouds (e.e... iFumagalli et alJ 1201 lh 
and are capable o f self-shielding against ionizing radiation 
dAltav et alj|201 lb . which results in a pronounced flattening 
of the distribution at these column densities. At the highest 
column densities of Nm ^ 10 20 cm" 2 , DLA absorption lines 
are thought to arise in the inters tellar medium of high-redshift 
galaxies (e.g.. lWol fe et alJ l2QQ5i) . These absorption lines thus 
directly probe properties of cold gas in high-redshift disks 
which contain most of the cold gas in the universe. 

As we discuss in the next section, modeling the column 
density distribution and other properties of these DLA sys- 
tems is both challenging and interesting. It is challenging 
because the small-scale density and velocity structure of the 
high-redshift ISM must be reproduced correctly. This same 
reason makes it interesting since the column density distri- 
bution provides a unique way of testing the small-scale gas 
distribution in theoretical models. 

The column density distribution for DLAs has two impor- 
tant features which we will discuss in this paper. First, at 
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both low and hig h redshifts there is a pronounced turnover at 
N m « 1Q 21 cm- ? dZwaan etalJl2005l: lN oterdaemeetal.l l2009t 
iProchaska & Wolfe! 120091) . The location of this turnover is 
found by fitting the column density distribution with either 
a gamma distribution or a double power-law. At low red- 
shift, where observations extend to higher column density, 
there is an additional feature. We see that there is a transi- 
tion from HI-H2 at Nu ~ 10 22 cm" 2 wh ich leads to a lack 
of HI systems above this column density (Zwaa net alj|2005l 
iZwaan & Procha ska 2006). At high redshift this region has 
not been probed due to insufficient statistics. Previously, these 
two features were not distinguished but in this paper we will 
treat them separately since we are interested in differentiat- 
ing the physical mechanisms which control each of them. In 
previous works, authors tried to understand the steepening 
in the column density distribution at high column densities, 
Nm > 10 21 cm" 2 . Proposed explanations discussed so far in- 
clude selection effects due to dust obscuration, conversion of 
atomic hydrogen to molecular hydrogen, and inclination ef- 
fects due to randomly oriented galaxies. 

Dust obscuration was c onsidered by Fall & Peil (1 1993b and 
iVladilo & Perouxl ( 120051) . who argued that it could explain 
the steepening in the DLA regime. The importance of dust 
obscuration is still a s ubject of debate (Ellison et alJ 12001b 
Jorgenson et alj 120061: iFrank & Perouxl 1201 Ot iKaplan et al.l 
20101). In p a rticula r, iKaplan et al.l d2010l) and, most recently, 



Khare et ail (12012b found that a non-negligible fraction of 



high-metallicity DLAs are significantly reddened. Given that 
optical quasar samples are constructed using color-based can- 
didate selection, the samples may be biased against signif- 
icantly reddened QSOs. In the extreme case, when a QSO 
is completely obscured in the UV and optical range by dust, 
such a quasar would be missing in radio-selected samples 
as well because no optical spectrum or redshift would be 
measured for the radio source. However, we note that al- 
though empty-fields have been found in radio-selected sam- 
ples, these empty-fields were found to contain extended struc- 
ture in the IR indicating t hey were not high-redshift quasars 
(see lJoreenson et aTll2006l for more details). 

A physical explanation for the lack of high column den- 
sity systems above Nm ~ 10 22 cm" 2 discussed by a num- 
ber of authors is the atomic-to- molecular transition (LS chave 
200 U iHira shita & F erraral 120051; IZwaan & Prochaskal 12006b 
Krumholz et al.l2009atlCenl2012tlAitav et al.l201 lb . The idea 



is that at large column densities the gas is self-shielded and 
hence can form H2 if there is sufficient dust. The transition 
from HI to H2 is expected to occur at a characteristic column 
density and can thus introduce a feature such as a steepening 
of the HI column density distribution. 

Inclination effe c ts were first consid ered in iMil groml d 1 9 8 8 ) : 
iFall & Peil d!993l) : IWolfe et al.l d 1995b where the authors con- 
sidered the column density distribution of randomly oriented 
disks. They assumed that the disks had a radial column den- 
sity profile which was monotonically decreasing with some 
maximum column density at some minimum radius. They 
considered a universe populated with these disks at random 
orientations and found that the column density distribution of 
such a model has a kink at the maximum column density of 
the radial profile. We will further generalize this model in §[3] 

In this paper we emphasize the two features in the col- 
umn density distribution for DLAs. The lack of systems in 
the local universe above Nm ~ 10 22 cm" 2 can be e xplained 
by the HI-H 2 transition (IZwaan & Prochaskal [2006b . How- 
ever, we will argue that observations show that the turnover 



at Nm ~ 10 21 cm" 2 cannot be explained by the same mech- 
anism. This is because the HI-H2 transition depends on the 
metallicity and UV radiation of the environment. However, 
observations show that the turnover at Nm ~ 10 21 cm" 2 oc- 
curs both in local galaxies, which have metallicities and in- 
terstellar UV fields close to those of the Milky Way, and in 
high-redshift galaxies, which have low metallicities and high 
UV fluxes. The HI-H2 explanation would predict a transition 
at a much higher column density at high redshift and therefore 
a turnover which is not independent of redshift. In addition, 
we show that the HI column density distribution of individual 
z ~ galaxies all have a similar turnover which does not ex- 
hibit a significant dependence on metallicity. Furthermore, we 
can show that the turnover at Nm ~ 10 21 cm" 2 can naturally 
arise from randomly oriented galaxies. 

We present details of the argument against molecular hy- 
drogen formation being the cause of the turnover at Nm ~ 
10 21 cm" 2 in §12 In §[3] we consider the effect of randomly 
oriented galaxies on the column density distribution and find 
that they naturally give rise to a turnover in the column density 
distribution which depends only on the characteristic maxi- 
mum column density of a galaxy and not on the small-scale 
features. In §|4]we use the column density distribution func- 
tion of DLAs as a stringent test of the gas distribution in 
galaxy formation simulations and show that our simulations 
are discrepant with observational results. In § [5] we com- 
pare our results to other recent results from galaxy formation 
simulations by different groups and argue that although cur- 
rent models are quite successful in matching and explaining 
the column density distribution of HI absorbers over a wide 
range of column densities, simulation results are generally 
discrepant or in tension with observations beyond the turnover 
atNm £ 10 21 cm" 2 . 

2. TURNOVER IN THE HI COLUMN DENSITY DISTRIBUTION AND 
THE HI-H 2 TRANSITION 

2.1. HI Column Density Distribution 

The column density distribution of quasar absorption sys- 
tems is defined as the number of atomic hydrogen systems, 
N, along a random line of sight per unit column density, dNm, 
per unit absorption length, dX: 



f(Nm,z) = 



d 2 N 
dNmdX' 



where 



dX : 



Hp 
H(z) 



(1 +zfdz. 



(1) 



(2) 



In this study we focus on the highest column density regime of 
DLAs, which are associated with the ISM of galaxies hosted 
by dark matter halos of different mass, M. Using the comov- 
ing number density of halos per unit halo mass, dn(M, z) / dM, 
along with the differential HI cross-section associated with 
such a halo, d<r(M, Nm, z) I dNm, where <r(M,Nm,z) is the 
cross-section in proper units for producing absorbers with col- 
umn densities lower than Nm, we can re-express the column 
density distribution as 



f(Nm,z) 



Ho J 



do-(M,N m ,z) dn(M,z) 



dN m 



dM 



dM. 



(3) 



This convention emphasizes the physical factors which affect 
the column density distribution function. The halo number 
density depends on the physics of halo formation which is 
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fairly well understood. However, the differential cross-section 
in a galaxy with a given halo mass is determined by the sur- 
face density distribution on the length scale at which it is 
being measured. For quasar absorption studies at high red- 
shift, this scale is comparable to the transverse physical size 
of the quasar emitting region (i.e., <k pc). This distribution 
will be affected by ISM processes such as turbulence, gravi- 
tational instability, gas chemistry, star formation, stellar feed- 
back, etc. Thus, the column density distribution provides a 
unique high-resolution window into the structure of the ISM 
of high-redshift galaxies and the processes that shape it. 

At the same time, this presents an obvious challenge to the 
models since the highest resolution galaxy formation simula- 
tions have resolutions which are orders of magnitude larger 
than the scale probed by quasars. Thus, when simulation re- 
sults are compared to observations, it is implicitly assumed 
that disparity in scales and unresolved small-scale structure 
of the ISM does not affect the column density distribution. 
Although highly non-trivial, as we discuss below this assump- 
tion is supported by observations of the column density distri- 
bution at different scales in nearby galaxies and is due to the 
steep power spectrum of HI column density maps. 

2.2. Observed Column Density Distribution ofDLAs at high 
and low redshifts 

In Figure Q] we plot the column density distribution of sys- 
tems in the local universe and at z — 3. As was noted before 
(Zwa an etalJl2005t iProchaska et"ai1l2005t iNoterdaeme et all 
2009), the column density distribution of DLAs at z ~ 
and z ~ 3 are remarkably similar and exhibit a turnover at 
Nm ^ 10 21 cirT 2 . At low redshifts, where systems have been 
measured to higher column densities, this turnover continues 
to higher column densities and DLAs with Nm ^ 10 22 cm" 2 
become exceedingly rare. Now we will discuss the physical 
mechanisms that may be responsible for the steep decrease of 
the cross-section of high column density gas. 

Column densities above Nm ~ 10 cm" 2 are typical for 
giant molecular clouds (GMCs) and f(Nai,z) in this regime 
should probe gas associated with, or fueling formation of, 
GMCs and star formation in galaxies. For this reason, it is nat- 
ural to associate the lack of syste ms above Nm ~ 10 22 cm" 2 
with the HI-H2 transition. Indeed. IZwaan & Prochaskal (120061) 
showed that for local z ~ galaxies the column density distri- 
bution of HI smoothly joins onto the column density distribu- 
tion of molecular gas, with a cross-over at A^h ~ 10 22 cm" 2 , 
indicating that the turnover is due to the HI-H2 transition. Re- 
cently, the HI-H2 transition was invoked in theoretical stud- 
ies as a significant factor in steepening the column de nsity 
distribution at Nm ^ 10 22 cm" 2 even for high-z DLAs (ICenl 
120121; lAltav et al.ll201 lh . However, the H 2 models used in 
these studies were calibrated on local galaxies, most of which 
have solar or super-solar metallicities. The HI-H2 transition, 
on the other hand, is expected to depend on both the amount o f 
dust and the ambient far UV radiation (e.g. Elme greenl 19931) . 
The dust helps to shield molecular gas from photodissociation 
and acts as a catalyst for H2 formation. If the dust-to-gas ra- 
tio is proportional to the metallicity of the gas, as ind icated 
by observations at Z > 0.1Z o (e.g., Draine et al. 20Q3), the 
HI-H2 transition will corresp ondingly depen d on the metal- 
licity of the absorbing gas. In Schaye (2001), the author de- 
veloped a theoretical model which predicted the maximum 
HI column density as a function of metallicity. However, his 
model predicted that the maximum HI column density scales 



as « 1 / VZ, in disagreement with local observations of the 

H I-H2 transition . 

iGillmon et al. (2006) measured the transition in the Milky 
Way and T umlinson et al.1 (120021) measured the transition in 
the Small Magellanic Cloud (SMC) and Large Magellanic 
Cloud (LMC; see also a compilation of archiv al measure- 
ments in the SMC, LMC and Milky Way by IWeltv et all 
120121) . The transition in the Milky Way occurs at a column 
density of Nh ~ 10 20 4 - 10 20 7 cm" 2 , depending on whether 
the line of sight is at high latitude or along the disk. This dif- 
ference is expected due to a higher ^-dissociating UV field 
in the disk plane. The transition in the LMC (Z « O.3Z ) 
occurs at Nh > 10 21 3 cm" 2 and the transition in the SMC 
(Z m O.15Z ) occurs at Nh > 10 22 cm" 2 . Thus, the transition 
column density in these galaxies scales as « 1 /Z or even more 
strongly. 

In light of the expected metallicity dependence of the HI- 
H2 transition, the similarity of the low and high-z HI column 
density distribution turnover at A^hi ~ 10 21 cm" 2 in FigureQ]is 
remarkable and strongly disfavors the HI-H2 transition as the 
physical origin of the turnover. While t he low-redshift sys - 

tems have metallicities of [Z/H] 0.5 (IZwaan et al.l 2005). 

high-redshift DLA s have typical metallicities of \Z/ H] < -1 
(INoterdaeme et al.l 120091 IProchaska & Wolfdl20091) . and we 
would thus expect the turnover in the high-z f(Nm,z) to oc- 
cur at least 0.5 dex higher in column density. Yet, the col- 
umn density distributions of these systems exhibit turnovers 
at similar column densities. This indicates that the origin of 
the turnover is not due to the phase transition from atomic to 
molecular gas, but rather due to the overall density structure 
of interstellar gas in galaxies. Remarkable similarity of the 
low- and high-redshift distribution indicates that some univer- 
sal and metallicity independent process is likely responsible 
for this structure. 

This result is consistent with the distribution of DLAs in the 
Nm-Z plane which has been used to a rgue that there is a metal - 
li city dependent maximum N m (e.g. lKrumhoIz et alj |2009a). 
In lKrum holz et al. (2009a), the authors present a compilation 
of observations of high-redshift DLAs and show that the dis- 
tribution of DLAs in the Nm-Z plane is consistent with being 
bounded by a metallicity dependent maximum Nm which is 
due to the HI-H2 transition. Binning this data in Nm gives 
the column density distribution of the compiled sample which 
displays a turnover at Nm ~ 10 21 cm" 2 . This binning exer- 
cise can be used to show that the feature is primarily due to 
the density of points in the Nm-Z plane rather than the cutoff 
in the plane due to the HI-H2 transition, and hence that the 
turnover at Nm ~ 10 21 cm" 2 is not primarily due to the HI-H2 
transition. 

2.3. Column Density Distribution within the ISM of z ~ 
Galaxies in the THINGS Sample 

If the shape of the overall HI column density distribution 
is related to some universal features of the HI surface den- 
sity distribution in individual galaxies, it would be interest- 
ing to explore whether such universality is indeed observed 
in individual galaxie s. To this end, w e use galaxies from 
the THINGS sample (I Walter et al.ll2 008). in which HI surface 
density maps are measured for 34 local galaxies using 21- 
cm observations. The physical resolution of the maps ranges 
from ~ 100 to ~ 500 pc depending on the distance to the 
galaxy. We use the robust weight data cubes of the galaxies 
since they have the highest resolution. We have omitted three 
galaxies from consideration: M81 DwA, NGC 3031, and IC 
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Fig. 1. — Column density distribution for DLAs at two different redshifts. 
The circles, z = (Zw+05), are fro m observations of local galaxies based on 
21cm emission iZwaan et al. 2005). The triangles, z = 3 (Pr + 09), are from 
observations of quasar absorption lines in SDSS DR5 in the redshift range 
Z e [2.2, 5.5] (Prochaska & Wolfe 2009). Finally, the squares z = 3 (No+09), 
are from observatio ns of quasar absorption l ines in SDSS DR7 in the redshift 
range z £ [2.2, 5.2] I Noterdaeme et al. 2009). The decrease in the overall nor- 
malization is thought to be due to star formation fueled by DLAs and due to 
processes which remove HI gas from galaxies such as A GN activity, galactic- 
scale winds, tidal effects, and ram pressure stripping (see Prochaska & Wolfe 



2574. M8 1 DwA does not have metallicity information^], the 
NGC 303 1 map suffers from significant point source contam- 
ination, and IC 2574 was not available online. The SMC col- 
umn density distribution comes from a HI map of the SMC 
wit h a 200 pc resolution , kindly provided by Alberto Bolatto 
(see lBolatto et al.ll201 ll for details). 

Note that we do not smooth these maps to achieve a uniform 
spatial resolution independent of the distance to each galaxy. 
We do not believe this will significantly affect our results. 
IZwaan et alj (12005) showed that the HI column density dis- 



tribution of nearby galaxies at Nm ~ 10 - 10 cirT- did not 
change significantly when they varied the resolution of their 
maps from 1.5 kpc to 3 kpc. In addition, studies of nearby 
galaxies have shown that the HI power spectrum has a power- 
law behavior over a wide range of scales, ranging from ~ 0.1 
pc up to ~ 10 kp c (Stanimiro vic et al. [T999; Elme green et al.1 



l200lt iMiville-Deschenes et alj|2003t fDutta et al.ll2009l etc.). 
These power-laws are steep with exponents ranging from 
~ -1.5 to ~ -3.5. Such a steep power-law behavior indi- 
cates that the HI maps do not have large-amplitude fluctua- 
tions on small scales and hence the column density distribu- 
tion should not look very different as we change the scale. If 
the HI column density distribution of high-redshift galaxies is 
characterized by similarly steep power spectra, this would val- 
idate the assumption that the column density distribution com- 
puted from quasar lines of sight can be meaningfully com- 
pared to the column density distribution computed from 21- 
cm HI maps that probe column densities averaged on vastly 
larger scales. 

6 M8 1 DwA is a low-mass dwarf irregular galaxy and, as such, likely has a 

low metallicity of [Z/H] 1 . Nevertheless, HI column density within this 

galaxy does not exceed Nm ~ 5 X 10 20 cirT 2 (Walter et al. 2007). 



For each THINGS galaxy that we use, we compute the col- 
umn density distribution: 



Wm) = 



AJVhiAX ' 



(4) 



where AAAWhi) is the number of resolution elements with col- 
umn density in the range Nm to Nm + AA^hi and AX is the 
absorption length for each galaxy, which is arbitrary. In terms 
of the formalism from § 12. 11 f(Nm) is proportional to the dif- 
ferential cross-section of the galaxy, 



Wm) * 



c da-(M,N H1 ) 
H~o dN m ' 



(5) 



Since we are just comparing the differential cross-sections and 
shapes of the column density distributions in different galax- 
ies, we have adjusted the normalization by choosing AX such 
that all column density distributions coinc ide with the column 
density distribution of local galaxies from lZwaan et al.l (120051) 
atA^m = 10 20 cirT 2 . 

The resulting column density distributions for the THINGS 
galaxies are plotted in Figure |2l in which each colored line 
represents the distribution of an individual galaxy with color 
indica ting the galaxy's metallicity (taken from Walter et al. 
120081) , as shown by the color bar. Although the THINGS 
sample consists of a wide range of systems, from dwarf ir- 
regulars to spirals, with metallicities varying by more than 
1 .5 dex, the HI column density distributions of these galaxies 
are quite similar and the scatter is surprisingly small. There 
is no clear correlation of the shape of the distribution with 
metallicity; all galaxies have a turnover at column density 
N m ~ few x 10 2 " cnr 2 . 

This is shown more quantitatively in Figure [3] where we 
plot the turnover column density versus galaxy metallicity. 
We determine the turnover column density by performing 
a double power-law fit to the deprojected (face-on) column 
density distribution of each galaxy; we define the turnover 
column density as the column density where the two power 
laws intersect. To deproject the column density maps we 
just rescale the column densities in individual pixels by cos 9, 
where 6 is the inclination angle of a particular galaxy. The 
figure shows that there is at best only a weak correlation be- 
tween turnover column density. The correlation coefficient 
for the shown points is -0.40 + 0.14, where the error is com- 
puted from 10000 bootstraps. Although a weak correlation 
is present, we do not observe any strong correlation which 
would be expected if the turnover was caused by the HI-H2 
transition. The weak correlation that may be present could be 
due to the fact that the column density distribution depends 
weakly on the mass of the galaxy, which would also imply a 
weak correlation with metallicity due to the metallicity-stellar 
mass relation. In addition, we note that the two galaxies with 
the highest characteristic column densities in Figure [3] (SMC, 
NGC 1569) are galaxies showing signs of strong tidal inter- 
actions with their neighbors and have associated bursting star 
formation. Their higher than average column densities may 
thus be due to the tidally-induced inflow of gas into their cen- 
tral regions. 

As a check of this weak correlation, we performed two ad- 
ditional tests. First, we did the analysis in Figure[3]using more 
recent metallicities for th e 23 galaxies in THINGS which have 
updated metallicities in Moustaka s et al.l ((2010). We used 
the metallicities derived using the empirical calibration from 
dPilvugin & Thu an 2005) and found that the correlation func- 
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tion became -0.39 ± 0.14, indicating that the effect of using 
the updated metallicities is small. Second, we performed a 
KS test on the inclination corrected column density distribu- 
tions. We separated the galaxies into quartiles based on their 
metallicities and then performed KS tests between all pairs 
of galaxies with one drawn from the bottom quartile and the 
other drawn from the top quartile. The result was that 90% 
of such pairs had a KS probability greater than 0.01 of being 
drawn from the same distribution. Thus, in addition to the re- 
sult in Figure [3] we have a non-parametric test which shows 
that the column density distribution of the THINGS galaxies 
show little metallicity dependence. 

The lack of strong correlation between the turnover column 
density and the gas metallicity shown in Figure [2] and Fig- 
ure [3] is in contrast with the strong correlation that would be 
expected if the HI-H2 transition was shaping the turnover. In 
this respect, we can also note that although the column density 
distribution of molecular hydrog en extends the HI distribu - 
tion smoothly in z ~ galaxies (Zwaa n & Procha ska 2006), 
H2 only affects the shape of the total hydrogen distribution 
for N m > 10 21 7 cm" 2 and f(N m ) < 10~ 26 . This limit comes 
from computing how many HI and H2 lines of sight there are 
above a given column density, which is just the integral of the 
column density distribution. Above the column density of the 
turnover, Nm ~ 10 21 cm -2 , there are more than 10 times as 
many HI lines of sight as H2 lines of sight. 

With this argument in mind, we see that the HI-H2 transi- 
tion does not cause the turnover of the column density distri- 
bution at Nm ~ 10 21 cirT 2 in the local universe. Furthermore, 
although the HI-H2 transition can help explain the paucity 
of HI absorbers with Nm > 10 22 cm for relatively metal- 
rich galaxies at z ~ 0, such an explanation would not work 
for higher-redshift DLAs. These DLAs have considerably 
smaller metallicities and for low metallicities the transition 
is expected to occur at much higher column densities (see ob- 
servations of t he tr ansition in the Milky Way, SMC, and LMC 
discussed in § !2.2l and theoretical predictions shown in Fig. [7] 
below). This reasoning predicts a tail out to high Nm in the 
high-redshift column density distribution which is not present 
in the local universe. 

The similarity of the column density distributions of the 
galaxies in THINGS, despite a wide range of metallicities, 
sizes, star formation rates, tot al HI masses, and absolute lu- 
minosities (Wal ter et al.l l2008). implies that the shape of the 
column density distribution at high column densities may be 
due to some universal physical mechanism. In the next section 
we will explore the connection between the column density 
distribution of a single galaxy and the column density distri- 
bution measured for a statistical sample. 

3. COLUMN DENSITY DISTRIBUTION OF RANDOMLY ORIENTED 
GALAXIES 

In the previous section, we presented arguments that the HI- 
H2 transition cannot explain the turnover at Nm ~ 10 21 cm 2 . 
Here we investigate the origin of this turnover by examining 
how the column density distribution within individual galax- 
ies contributes to the overall statistical column density distri- 
bution probed by quasar lines of sight. The inclination angle 
of a galaxy can affect the column density distribution since it 
affects both the number of absorption lines through a galaxy 
and the column density of those lines due to projection. 

The effect of i nclina ti on angle was inv estiga ted in stud- 
ie^lMilgro^ (fl988h . iFall & Peil (fl993l and lWolfeetal.1 
( 1995), in which the authors considered column density dis- 
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Fig. 2. — Column density distribution for individual galaxies (different col- 
ored lines) in the THINGS sample and the SMC. Each column density distri- 
bution line is colored by the oxygen abundance of the galaxy relative to the 
Sun (12 + (Oq/Hq) = 8.6), where the color bar is linear in oxygen metal- 
licity. For comparison, the dashed line is the HI column density distribution 
of galaxies in the local universe measured by Zwaan et al. 12005). The SMC 
column density distribution is the one which extends to the highest column 
densities. It is constructed from a HI map of the SMC with a 2 00 pc resolu- 
tion, kindly provided by Alberto Bolatto (see Bolatto et al. 201 1, for details). 
The SMC has a metallicity of 12 + (O/H) = 7.96. The column density dis- 
tribution of each galaxy has been normalized such that it coincides with the 
distribution of Zwaan et al. 1 2005) at Nm = 10 20 cirT 2 . Note that the column 
densities in this figure have not been corrected for inclination effects because 
we are emph asizing the comparison with the column density distribution of 
IZwaan et all j2005) which considered a statistical sample of galaxies. We 
will discuss inclination effects further in §f5] 



tributions arising from randomly oriented thin disks with fixed 
radial surface density profiles. They assumed that the radial 
column density profiles are monotonically decreasing, with a 
maximum column density at some minimum radius. Under 
these assumptions, they showed that the column density dis- 
tribution has a kink at the maximum column density of the 
galaxy viewed face-on. Above this column density, all sight 
lines are due to projection effects and the column density dis- 
tribution behaves as a power-law with an exponent of -3, in- 
dependent of the specific radial profile. Below the critical col- 
umn density, the column density distribution depends on the 
details of the radial profile. 

First, we would like to note that this result can be extended 
to surface density profiles that are not axially symmetric. To 
prove this we consider a galaxy which, when viewed face- 
on, has a column density distribution given by f±(N m ), where 
A^ is the column density as measured for the face-on galaxy. 
Next, we rotate this galaxy so it has an inclination angle 8 
with respect to the observer and consider the column density 
distribution, denoted by fe{N m ). We make the assumption 
that the galaxy is thin in the z direction which allows us to 
relate the inclined column density, A^ p to the face-on column 
density, A^ p via A^, = N m / cosO. Furthermore, when we 
incline the galaxy, the number of lines of sight piercing the 
galaxy decreases by cos 6 due to projection effects. Therefore, 
the number of lines of sight between A^ and A^j + dN m is 
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Fig. 3. — Turnover column density in individual THINGS galaxies versus 
metallicity of the galaxy. The turnover column density is determined by fit- 
ting a double power-law to the deprojected (face-on) column density distri- 
bution of each galaxy. To account for inclination, the column densities have 
been scaled by cos 8, where 6 is the inclination angle of the galaxy. The 
correlation coefficient is -0.40 ± 0.14 where the error bars are from 10000 
bootstraps over the data. There is no strong correlation of turnover column 
density with metallicity, as would be expected if the HI-H2 transition was the 
main factor shaping the turnover of the column density distribution. Note that 
discreteness of the turnover column density is 0.05 dex which is the resolution 
we used to compute the column density distribution of each galaxy. 



given by 



thus we get 



fe(N e m )dN e m = cos Of ± (N^)dN^, 



f e {N e m ) = cos 2 Of i(A^) 



(6) 



(7) 



Finally, we average this distribution over all possible inclina- 
tion angles to get the column density distribution of randomly 
oriented galaxies: 



Wm) 



-a 
-a 



d6 sin 6 f e (N m ) 



d6 sin 6 cos 2 0f ± (N m / cos 6) 



Jo 



(8) 



While equation (HI) has appeared i n the literature (e.g. 
I Wolfe et al.ll 1995b I Wolfe & Chenll2006l) . previously it was de- 
rived for column density distributions arising from galaxies 
with monotonically decreasing radial profiles. Now we see 
that the result is more generic and applies to any two dimen- 
sional structure, not simply disks. We would like to note that 
equation ([8]) implies that the column density distribution has a 
tail which extends to arbitrarily large column densities. This 
is due to the relation between the inclined and face-on column 
density, = N^J cos 6, which implies that arbitrarily large 
column densities can be reached as we approach an edge-on 
inclination. Of course, the real maximum is given by the max- 
imum edge-on column density. Therefore, even the randomly 
inclined column density distribution naturally has a maximum 



column density. More precisely, this inclination model starts 
to break down for large inclination angles when the lines of 
sight pierce uncorrected regions in the galaxy. 

As we see from equation ©, if the face-on galaxy has a 
maximum column density, or if the face-on galaxy column 
density distribution becomes steeper than above some 
column density, then above this column density the column 
density distribution will behave as a power-law with an ex- 
ponent of -3. Note that the high column density tail seen in 
Figure[T]is consistent with this -3 exponent. 

With this idea in mind, we return to the plots we have above 
regarding the column density distribution of the THINGS 
galaxies. As we can see in Figure[2] individual galaxies have 
steeper distributions than the local column density distribu- 
tion. This effect is further emphasized in Figure [3] where 
we include inclination corrections and determine the turnover 
column density by fitting a double power-law to the face-on 
corrected column density distribution of each galaxy. Above 
the turnover column density, the column density distributions 
of individual galaxies have slopes steeper than -3. From 
equation ^ we see that inclination averaging wipes away the 
precise details of the column density distribution above this 
turnover column density because the integral in equation ([8j 
is relatively insensitive to steep column density distributions. 

Now we can use this intuition to understand which sys- 
tems are most important for the column density distribution. 
Since inclination averaging wipes away information beyond 
the turnover of an individual galaxy, the systems with col- 
umn densities above this turnover are not important for the 
global column density distribution, as measured for a statis- 
tically large sample. These systems are necessarily small in 
size (< few 100 pc) since they contribute relatively few lines 
of sight. Thus, we are arguing that the small-scale features 
in column density maps are not important for the global col- 
umn density distribution whose high end tail is determined by 
projection effects of the bulk of the gas in a galaxy. 

To give further evidence to support this assertion, we con- 
sider the galaxy NGC 2403 from the THINGS sample. In 
Figure|4]we display the face-on corrected column density dis- 
tribution of NGC 2403 computed using maps with pixels of 
different size (indicated in the legend). In addition, we com- 
pute the column density distribution as derived from the radial 
average surface density profile of NGC 2403. As we can see, 
the column density distribution is quite insensitive to the av- 
eraging scale of the map. As we average over progressively 
larger scales we simply lose the relatively rare highest column 
density regions, while at lower column the distribution is in- 
sensitive to the averaging scale. This insensitivity is due to the 
steep power spectrum of HI maps, as noted in § 12.31 As a re- 
sult, the turnover of the column density distribution is almost 
identical over a wide range of scales. In addition, the column 
density distribution derived from the radial profile has a sim- 
ilar turnover. As a quantitative measure of this insensitivity 
to resolution, we fit double power-laws to the distributions in 
Figure |4] and found that the turnover column density changed 
by 0.05 dex as we changed the resolution from 93 pc to 372 
pc, and by another 0.05 dex as we changed the resolution from 
372 pc to 1 .49 kpc. This indicates that the scatter due to reso- 
lution effects in Figure [3] should be quite small. 

In Figure [5] we show the inclination averaged column den- 
sity distribution computed using the face-on column density 
distribution from Figure [4] and equation ©. We see that the 
inclination averaged column density distribution i s very simi- 
lar to the local column density distribution from lZwaan et alj 



7 



10" 21 

10 -28 
10 -23 
10 -24 
10 -25 
10 -26 
10 -27 
1Q -28 



n 1 — i inn 



h 1 — I I I I I I I 



10 -29 

1Q 20 



•••«. -1 dex » 

-2 dex •• . I 

' \ " 

-3 dex 

\ ° ■ 

£ z=0 (Zw+05) 
> NGC 2403 93pc 

NGC 2403 372pc 
o NGC 2403 1.49 kpc 
NGC 2403 radial 

I L_ 



10 21 
N HI (cm- 2 ) 



10 2 



Fig. 4. — Column density distribution of NGC 2403 corrected for inclination 
and computed using maps of HI averaged on different scales. We compute 
the column density distribution at different resolutions by first averaging the 
maps over patches of the stated scale, and then computing the column density 
distribution from the resulting maps. We note that the column density distri- 
butions are similar over a wide range of scales, as expected due to the steep 
power spectrum of HI maps. We also show column density distribution of 
this galaxy constructed from its radial surface density profile (triangles), av- 
eraged in radial bins of 150 pc using the HI map at 93 pc resolution, corrected 
for inclination, and using radii of the face-on projection. The column density 
distribution derived from the radial profile is similar to column density dis- 
tribution derived from a coarse grained map of the galaxy. For comparison 
purposes, we have offset each subsequent column density distribution by 1 
dex. In addition, we hav e included the colu mn density distribution of local 
galaxies as measured by Zwaan et al. 1 2005), listed as z = (Zw + 05). 



(2005). Note that we repeated this procedure for the other 
galaxies in THINGS and got similar results. In addition, we 
see that the inclination averaged column density distribution 
is even less sensitive to the resolution than the face-on col- 
umn density distribution. This is due to the fact that equation 
([8]) is insensitive to the steepest parts of the face-on column 
density distribution and hence insensitive to the small-scale 
structures in the HI map. As a result, even the average radial 
surface density profile of HI is sufficient to produce an inclina- 
tion averaged distribution that is similar to that obtained from 
the highest resolution HI map. Note that the high-A^i tail of 
the column density distribution, Nhi > 10 22 crrT 2 , can still be 
sensitive to the small-scale structures in the map because even 
an edge-on galaxy will have some maximum column density. 

From Figure [3] we see that the turnover column density is 
relatively similar for all galaxies in the THINGS sample and 
depends only weakly on metallicity. Through inclination ef- 
fects, this turnover column density within each galaxy gives 
rise to the turnover in the column density distribution for a 
statistical sample of galaxies. In light of this idea, we can 
now better understand the universality of the column density 
distribution as seen in Figure Q] The similarity in turnover of 
the column density distribution at low and high redshift indi- 
cates that the galaxies at low and high redshifts have similar 
maximum column densities of HI. We argue that this universal 
maximum is not due to the HI-H2 transition since this would 
imply a dependence on metallicity which is not seen. The 
exercise described in this section also indicates that this max- 
imum column density does not arise in the small-scale struc- 
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Fig. 5. — Inclination averaged column density distribution of NGC 2403 
computed at various resolutions. To compute the inclination averaged col- 
umn density distribution, we take the face-on column density distributions 
from Fig. [4] and then use equation (8). The description of how we compute 
the face-on column density distributions is given in Fig.g] Once again, we 
offset each distribution by 1 dex for comparison purposes. As we can see, 
randomly orienting just NGC 2403 gives rise to a column density distribution 
which l ooks remark ably similar to the local column density distribution from 
Zwaan et al. 12005), which is listed as z, = (Zw + 05). Furthermore, the in- 
clination averaged column density distribution is very insensitive to the scale 
at which it is computed, therefore the small-scale features are unimportant 
for determining the global column density distribution. In light of the discus- 
sion after equation J8), the turnover of the local column density distribution 
above JVhi ~ 10 22 cm- 2 may also be due to the maximum column density 
of an edge-on galaxy. Note that the dip in the radial column density distri- 
bution at small column densities is due to the radial profile not extending to 
large enough radii and hence small enough column densities. As a result, the 
face-on column density distribution derived from the radial profile does not 
extend to low column densities so the inclination averaged column density 
distribution will look artificially low for small column densities. 

ture of the HI distribution, shaped, for example by turbulent 
cascades, but is present in the large-scale structure of the HI 
disk. It is present, for example, in the average radial HI sur- 
face density profile of disks. It thus appears to be related to 
the global processes that set properties of galactic disks rather 
than the small-scale dynamics of the ISM. 

In principle, simulations of galaxy formation should also 
reproduce such universal behavior if the physical processes 
shaping the surface density structure of galaxies are modeled 
correctly. In the following section we will compare the ob- 
servational results discussed in this section to the results of 
cosmological galaxy formation simulations. 

4. SIMULATIONS 

In this section we will utilize our simulations in two ways. 
First, we will use the observations described in the previous 
section as a stringent test of our simulations. Second, we will 
use the simulations as a testing ground to investigate the ef- 
fects of various physical mechanisms on the column density 
distribution. This will teach us which of these physical mech- 
anisms are most relevant for the turnover. 

In this paper we use several sets of related simula- 
tions performed with the Adaptive Refinement Tree (ART ) 
code (lKravtsovll999HKravtsov etli]l2002l:lRudd et al.ll2008l) . 
which uses adaptive mesh refinement for both gas and dark 
matter, thus achieving a large dynamic range in spatial scale. 
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These simulations follow one or more regions of interest, typ- 
ically selected as Lagrangian regions of five virial radii around 
typical dark matter halos. Regions of interest are embedded 
into a coarsely resolved cosmological cubic volume with pe- 
riodic boundary conditions. In all simulations used here the 
maximum spatial resolution inside regions of interest is about 
260 comoving pc (65 pc at z = 3) which is the result of 9 
levels of adaptive mesh refinement. The highest dark matter 
mass resolution is 1.3 x 1O 6 M and the baryonic mass resolu- 
tion varies from ~ 10 3 M o to ~ 1O 6 M depending on cell size 
and density. 

All our simulations include three-dimensional radiative 
transfer of UV and ionizing radiation from stars formed dur- 
ing the simulation, as well as the cosmi c background. This i s 
done with the OTVET approximation (IGnedin & Abelll2001l) . 
Radiative transfer is especially important for the column den- 
sity distribution since it correctly models the local radiation 
flux which can ionize neutral hydrogen regions. Finally, these 
simulations include the non-equilib rium chemical network o f 
hydrogen and helium described in Gned in & Kravtsovl d201 11) . 
This network includes molecular hydrogen and, in particu- 
lar, the formation of H2 in both the pri mordial phase and on 
dust gr ains. We note that the H2 model in lGnedin & Kravtsovl 
d201 ll) is calibrated against observations of the Milky Way, 
LMC, and SMC. Therefore, we expect the model to be valid 
over a range of metallicities and radiation fields which extend 
to at least the values characterized by the SMC. 

Our fiducial run is a fully cosmological simulation with a 
box size of 6h l comoving Mpc and a single region of in- 
terest around a Milk y-Way type galaxy ( the simulation called 
"cosmo II" in iGnedin & K ravtsov 2011). The computational 
box outside the region of interest is covered with a uniform 
64 3 grid. The cosmological parameters used in the fiducial 
run are similar to the WMAP1 parameters: Q.m = 0.3, = 
0.046, cr 8 = 0.9, and h = 0.7. 

In addition to the fiducial run, we use a larger run with a 
25/T 1 comoving Mpc box and 256 3 root grid, focusing on 5 
Lagrangian regions around halos with masses between 10 12 
and 1O 13 M , with exactly the same physics and mass and spa- 
tial resolution as in the fiducial run. We use this simulation 
to test the robustness of our results and for better statistics of 
halo based properties. 

We also used the "fixed ISM" runs of IGnedin & Kravtsovl 
(1201 ll) . in which the dust-to-gas ratio and the overall normal- 
ization of the radiation field at 1000 A were fixed to constant 
values. These are useful when we want to compare particular 
galaxies to models with exactly the same dust-to-gas ratio and 
interstellar UV field. For example, as a model for the ISM of 
the SMC we can use a "fixed ISM" simulation with a dust-to- 
gas ratio corresponding to a metallicity of Z « O.1Z and a 
radiation field which is 10 - 100 times larger than that in the 
Milky Way. 

4.1. Computing the Simulated Column Density Distribution 

In o rder to compare our simulations to the observations in 
§ 12.21 we need to compute the column density distribution. 
The procedu re is similar to the one discussed for the THINGS 
galaxies in § 12.31 

Since the DLAs are rare in our simulation, we expect to find 
at most one DLA along a random line of sight. Therefore, we 
project our simulated box separately along each axis to get the 
projected column densities for the box. On the boundary we 
use a grid whose cell size is the same as the smallest cell in 
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Fig. 6. — Comparison of the fiducial run with observations at z = 3. The 
squares, z = 3 (No + 09), are from Noterdaeme et al. (2009). Clearly, we 
overpredict the frequency of high column density DLAs. 

our simulation. For the 6 comoving Mpc box we project 
onto a grid consisting of (64 x 2 9 ) 2 pixels which corresponds 
to a resolution of 65 pc at z = 3. Given this projected map, we 
can compute the column density distribution using equation 
To improve our statistics, we average the column density 
distribution over the three cartesian projections. We have ver- 
ified that the column density distribution has converged at this 
resolution for column densities below A^hi ~ 10 23 crrT 2 . 

In addition, when we project the box onto the boundary grid 
as described above, we also save the location of the highest HI 
density cell along each hypothetical line of sight. This is use- 
ful since we can subsequently project the HI density along a 
shorter segment which is centered on the maximum HI density 
cell. We have verified that if we integrate along a 27 comov- 
ing kpc segment about each of these maxima and compute the 
column density distribution, we have converged to the full- 
box column density distribution above A^hi = 10 19 crrT 2 . This 
is useful when we post-process the simulation since then we 
can save computational time by integrating along these rela- 
tively short lines of sight. 

Note that the column density distributions considered in this 
study were extracted from Lagrangian regions around progen- 
itors of massive galaxies and therefore they may not be fully 
representative of the mean distribution in a large random vol- 
ume of the universe. However, we focus on qualitative fea- 
tures of the column density distribution and do not attempt to 
make detailed quantitative comparisons. Given that the dis- 
crepancies we identify can be traced to the internal density 
distribution in galaxies of a wide range of masses, we believe 
our results and conclusions are representative and generic. 

4.2. Comparing the Simulated Column Density Distribution 
to Observations 

A comparison of the observed column density distribution 
to simulation results is shown in Figure [6] for both the 6 and 
25/i~' comoving Mpc boxes. Note that we have normalized 
the column density distributions to match the observed col- 
umn density distribution at Nm = 10 20 3 crrT 2 . 

From Figure [6] we see that the simulated column density 
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Fig. 7. — Comparison of column density distribut ions for various ISM con - 
ditions. The squares, z = 3 (No + 09), are from Noterdaeme et al. 1 2009). 
The dust-to-gas ratio and the radiation fields are in solar and Milky Way units 
respectively. As we see, for sufficiently high metallicity, we can reproduce 
the observed column density distribution. 

distribution turns over at a higher column density than the ob- 
served column density distribution. Since we have a large 
simulation volume which includes many galaxies, this result 
has already taken into account the effect of random orienta- 
tions which we described in § [3] Furthermore, since our sim- 
ulations have metallicities and radiation fields which are con- 
sistent with Lyman Break galaxies ove r a variety of redshifts 
(see Fig. 2 in lGnedin & Kravtsovl20 10) and our HI-H2 model 
is consistent with observations, we should be correctly repro- 
ducing the effect of the HI-H2 transition. The fact that our 
HI column density distribution is turning over at such a high 
column density indicates that our galaxies have face-on col- 
umn density distributions which extend to high column den- 
sity. This indicates that we are not correctly modeling the 
surface density distribution of the individual galaxies in the 
simulation. Below we will investigate how this can be reme- 
died and what it tells us about the physics which shapes the 
column density distribution. 

4.3. Effect of Metallicity and Radiation Field on the Column 
Density Distribution 

A large part of our argument in § [2] was based on the ef- 
fect of metallicity and UV field on the HI-H2 transition and 
the corresponding effect this would have on the column den- 
sity distribution. Since our simulations model molecular hy- 
drogen, we can test this directly in the simulations using 
the fixed-ISM runs. We considered SMC-like ISM param- 
eters: Z = O.1Z and a UV radiation field, denoted by U, 
which is 100 times stronger than the fields in the Milky Way, 
U = IOOUmw; and Milky Way-type ISM parameters: Z = Z 
and U = Umw- Finally, we consider an intermediate model 
with Z = 0.1Z o and U = Umw- The column density distribu- 
tions of these three model ISMs are presented in Figure [7j 

As we can see from Figure [7] an increase in the metallicity 
shifts the turnover to lower column density. This is expected 
due to the metallicity dependence of the HI-H2 transition. As 
we argued above, the observed turnover at Nhi ~ 10 21 cm"" 2 
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Fig. 8. — Proper cross-section versus the virial mass of the nearest primary 
halo. This data is from the 25/i"' comoving Mpc simulation to get better 
halo statistics. The triangles represent the cross-sections of halos above the 
DLA threshold of Nm = 2 X 10 20 cm" 2 . The squares represent the cross- 
sections of halos above Nm = 10 22 cm" 2 . The crosses represent systems 
above iVm = 10 23 cm" 2 . 

does not depend strongly on metallicity (Fig. [TJ and Fig. |2]i 
and thus is not due to HI-H2 transition. As a check that the 
difference in Figure |7j is indeed due to the HI-H2 transition 
and not some process which is blowing all the high density 
gas away, we can compare HI column density distribution to 
the neutral hydrogen column density distribution, which in- 
cludes both atomic and molecular gas. Such distributions are 
very similar for all of the metallicity and radiation field con- 
figurations in Figure [7] which indicates that the trend shown 
is indeed due to the metallicity dependence of the HI-H2 tran- 
sition. 

4.4. Systems Responsible for the High Column Density 
Discrepancy 

In order to understand which physical process are missing 
or incorrectly modeled in our simulations, it is helpful to know 
which systems are contributing the most to the high column 
density lines of sight. This is accomplished by associating the 
HI systems with their host halo and determining which ha- 
los give rise to the high column density systems. First, we 
consider the cross-section above a fixed column density ver- 
sus the virial mas s wh ich is plotted in Figure [8] In terms of 
the quantities in § 12.11 the cross-section for a halo of mass M 
above a HI column density of A^hi is given by 

r°° do-(M,N'z) 
o- Nm (M,z)= ™ dtT m . (9) 

From Figure[8]we can see how the cross-section of a galaxy 
above a fixed column density increases with increasing halo 
mass. Although the more massive halos have larger cross- 
sections, we must weight the cross-sections by the number 
density of their associated halos in order to know the relative 
importance of each galaxy type. 

One way to describe the importance of each mass range is 
to compute the column density distribution of systems associ- 
ated with each mass range. This is shown in Figure|9] As we 
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Fig. 9. — Contribution to the column density distribution from different 
ranges of halo masses. The lines are obtained from the 25/i~' comoving 
Mpc simulation at z = 3. The squares, z = 3 (No + 09), are observations 
from Noterdaeme et al. ( 2009). Note that the column density distributions 
here are normalized such that their sum matches the observed distribution at 
N m = 10 20 - 3 cm" 2 . 

can see, the shape of the column density distribution is fairly 
similar for all mass ranges. While the dwarf galaxies, with 
halo masses in the range 10 9 M o < M < 1O U M , give the 
largest contribution at all column densities, none of the mass 
ranges have column density distributions which are consistent 
with observations. Therefore, we see that we need to include 
physics which is efficient at removing high density gas from 
both dwarf galaxies and Milky Way-like galaxies. 

4.5. Additional Physical Mechanisms which can Affect the 
Column Density Distribution 

Now that we understand which systems are giving rise to 
the discrepancy, we can try to understand what additional 
physics we need to correctly model these systems. 

Looking back to equation (0, we recall that we must cor- 
rectly model the differential cross-section, or equivalently the 
surface density distribution, of each galaxy to get the correct 
column density distribution. We a re already modeling t he HI- 
H2 transition using the model of Gned in & Kravtsovl d201 ll) 
and observational evidence indicates that this transition does 
not play an important role for these systems. As a check 
we have also p o st-processed o ur gas with the H2 model o f 
iKrumholz et all J2008L [2009bb : IMcKee & Krumholzl d2010h . 
which leads to only a small decrease in the column density 
distribution function at large A^hi- The small difference be- 
tween our model a nd the model of Krumholz et al is not sur- 
prising, given that IKrumholz & Gnedir] (1201 ll) showed that 
these two H2 models agree well for metallicities greater than 
Z/Zo-0.1. 

4.5.1. Sub-grid Models for Turbulence and Self-gravitation 

While the observed column density distribution at high red- 
shift is probing the gas at scales significantly smaller than 1 
pc, the highest resolution of our simulations is only 65 pc at 
Z = 3. Therefore it is possible that the discrepancy between 
observed and simulated column density distributions is due 



to different column density structure on small scales. Indeed, 
quasar lines of sight probe the column density distribution on 
sub-parsec scales, while the smallest resolved scales in simu- 
lations are ~ 100 pc. It is not guaranteed a priori that column 
density distributions measured at such different scales should 
match. As an extreme example, consider a toy model in which 
the HI gas on small scales is in tiny ~ 1 pc size clouds. For a 
sparse scattering of clouds measured at a resolution of 1 pc or 
smaller, most lines of sight would pierce no clouds and a few 
would pierce the cloud giving rise to two peaks in the column 
density distribution. In contrast, at a significantly coarser res- 
olution, say 100 pc, the column density distribution would 
depend on the spatial distribution of these clouds. If they 
are uniformly spread out then the column density distribution 
would have a single peak at the average column density with a 
width given by the central limit theorem. However, in general, 
different regions could have different column densities when 
averaged over 100 pc, giving rise to a very broad column den- 
sity distribution. In principle, the column density distribution 
measured at different scales can look very different. 

We can attempt to model the effect of a physically plausible 
small-scale density distribution using the results of turbulence 
simulations on GMC scales. Turbulence leads to compres- 
sion and rarefaction of the gas, which can affect the column 
density of the gas and generically predicts a log-normal dis- 
tribution for densities. While we do not know the exact na- 
ture of DLA systems at high redshift, observations of the lo- 
cal universe indicate that the highest column density systems 
are in structures like giant molecular clouds (GMCs). The gas 
density PDF in such clouds has been studied and is known 
to be log-normal (see IMcKee & Ostrikerfl20"07t iKritsuk et al] 
2007, for a review). Interestingly, the colum n density PDF 
also obeys a log-norm al distributio n in GMCs dOstriker et alj 
1200 U iGoodman etail 120091) . In lOstriker et al.1 (120011) the 
authors simulated GMCs using magnetohydrodynamics and 
found that the column densities obeyed a log-normal PDF 
which indicated that the gas density was correlated along lines 
of sight due to coherent structures on larger scales. This was 
followed up by observational results in Goodman et alj ( 120091) 
where the authors observed the column density distribution 
of nearby GMCs. The observed column densi ties satisfied a 
log-no rmal PDF with similar variance to that of Ostriker et al. 
(2003). 

These studies found that the area-weighted column density 
PDF of a cloud with a mean column density of N is given by 



P(y) 



VI 



1 /l 7\ 



(10) 



wher e y - ]n(N/N) and n - o 2 12, as required for norm aliza- 
tion. lOstriker et all d2001l) and (Goodman et all (120091) found 
that the log-normal distribution is a good fit to the simulated 
and observed distributions with <x e (0.25, 0.55) depending on 
the conditions in the gas, ie mach number and magnetic field. 
In comparison to these GMCs, the DLAs consist of mostly 
atomic hydrogen, which can be expected to have a smoother 
spatial distribution. Therefore, the column density PDFs de- 
scribed above will serve as an overestimate of the variance in 
DLAs. In light of this, we have tested a large range of pa- 
rameters, cr e (0.1, 5), to see if there is any large effect on the 
column density distribution. 

We implement the log-normal PDF on a cell-by-cell basis. 
In each cell, along each line of sight, we take the HI column 
density and multiply it by a realization of the log-normal PDF. 
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Fig. 10. — Effect of the log-normal PDF of ISM turbulence on the column 
density distribution. The squares, z = 3 (No + 09), are observations from 
Noterdaeme et al. (2009). The highest observed values are cr ~ 0.6. We 
include the extreme case of cr = 4 to show that even an extreme log-normal 
PDF will not remedy the discrepancy. Note that we have normalized the 
cr = 0.6 and cr = 4 runs by the same amount for the sake of comparison. 

This procedure rarefies and compresses adjacent cells inde- 
pendently which is justified since our minimum cell size is 65 
pc at z — 3, larger than a typical GMC. Since this procedure 
changes the column density and hence the shielding proper- 
ties of the gas, we also re-process the res ulting compressed or 
rarefie d gas with fits to the H2 models in iGnedin & Kravtsovl 
(1201 ll) . The processed column density distributions are pre- 
sented in Figure[l0] In addition, we experimented with apply- 
ing the realization only if the individual cell has a sufficiently 
large A^hi, but the difference is negligible above this column 
density threshold. 

As we can see from Figure [lOl there is very little effect for 
reasonable values of the variance. However, for larger values 
we see that the normalization of the column density distribu- 
tion is decreased. This follows since the log-normal PDF is 
not symmetric and rarefies more systems than it compresses. 
We have also checked that this post-processing model of tur- 
bulence can be implemented by taking the convolution in log- 
space of the HI column density distribution with a log-normal 
PDF. 

Another important subgrid mechanism that can affect these 
high column density systems is self-gravitation. Observations 
of molecular clouds and simulations have indicated that self- 
gravitation gives rise to a power -law tail in the colu mn density 
PDF at high column densities dKritsuk et al.ll201 lh . We can 
include this effect by implementing the same procedure as for 
the log-normal PDF but also including a power-law tail. 

Since we do not know the exact power-law behavior that 
will be present in high redshift DLAs we once again test a va- 
riety of parameters. For re alistic power-law tails similar to the 
ones in lKritsuk et alJ (1201 ll) . we find no significant change in 
the column density distribution. This is expected from cross- 
sectional arguments since the regions in which the power-law 
behavior develops, GMCs, will be very small and hence very 
rare in a cross-section weighted sample. 

As a final comment, we note that the results of the log- 



normal PDF model described above are consistent with a n in- 
dependent, observation-based argument mentioned in § 12.31 
There we noted that observations of nearby galaxies show 
that the HI column density power spectra can be described 
by a steep power-law over a wide rang e of scales, rang- 
ing f r om ~ 0.1 pc up to ~ 10 kpc (St animirovi c et alJ 
19991: Elmegree n et alj|200lt iMiville-Deschenes et all 120031; 
Dutta et al]|2009l etc.), with slopes of the power spectra rang- 
ing from ~ -1.5 to 3.5. Such steep power-law behavior 

indicates that the HI maps do not have large-amplitude fluc- 
tuations on small scales and hence the column density distri- 
bution cannot be a strong function of averaging scale. This 
argument hinges on observations of local galaxies. However, 
if the origin of steep power-law spectra is due to some uni- 
versal process, such as turbulence, it is reasonable to expect 
that ISM giving rise to DLAs will have similarly steep power 
spectra. 

4.5.2. Toy Model for Feedback Effects 

Since DLAs are associated with the ISM of galaxies, they 
can be greatly affected by the physical processes which occur 
in the central regions of these galaxies where the star forma- 
tion rate is highest and effects of feedback can be expected to 
be strongest. Due to high densities and star formation rates, 
the centers of these galaxies will be affected by feedback ef- 
fects like supernovae and radiation pressure. So me of these 
effects have been included i n recent works (see lAltav et all 
l201lt iFumagalli et alJl20Tlh . While we have included the 
effects of supernovae, this feedback scheme is inefficient in 
driving outflows and does not change the distribution of gas 
appreciably. 

To explore the impact of a much stronger feedback, we con- 
sider a simple toy model. Since feedback mechanisms are 
most relevant for the high densities in the inner parts of galax- 
ies, we can set an upper limit of their effect by removing the 
inner parts of these disks when we calculate the column den- 
sity distribution. If this overestimate of the feedback effect 
is compatible with the observed column density distribution, 
then it is plausible that feedback can bring our simulation into 
agreement with observations. 

Figure QT| shows the column density distribution with re- 
gions of radius 1 and 3 kpc from the parent halo center re- 
moved. We see that removing a region of radius 3 kpc is 
sufficient to bring the simulations into agreement with obser- 
vations. It remains to be seen if realistic star formation and 
feedback models can account for such a substantial removal 
of gas from the central regions of high-redshift galaxies. 

5. COMPARISON TO RECENT SIMULATIONS 

A number of recent studies have presented column den- 
sity distributions fro m simulations and compared them 
to observations ( e.g. iRazoumov et al.l l2006t iPontzen et al] 
|2008t ICenl l20ll iNagamine et all 120 lfA lAltav et alJ 1201 Ik 
Fumagalli et al.ll201 lb . Some of the simulations in these stud- 
ies have successfully reproduced the column density distribu- 
tion while others were discrepant. Since the observed column 
density distribution is a sub-parsec scale probe of atomic hy- 
drogen at high redshift, it provides a stringent test of simu- 
lations. In fact, since the resolutions found in cosmological 
simulations are at best hundreds of parsecs, it is surprising 
that any simulation can match observations without delving 
into the sub-grid physics. In this section we will try and un- 
derstand the reasons behind the success and failures of these 
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Fig. 1 1 . — Effect on column density distribution of removing different sized 
regions around the center s of halos . The squares, z = 3 (No + 09), are 
observations from Noterdaeme et al. (2009). We see that removing the gas in 
the inner 3 kpc of halos brings the column density distribution into agreement 
with observations. 

simulations. Such insight will help us understand what addi- 
tional physical processes are needed in future simulations to 
o btain the observed colum n density distribution. 

iRazoumov et al.l (|2006) used an adaptive mesh refinement 
simulations with a set of physical processes similar to our sim- 
ulations. Their simulations include detailed gas chemistry for 
all species of hydrogen and helium, radiative transfer above 
the lyman limit, UV background, but no star formation and 
no H2 model for HI-H2 transition. They simulated a variety 
of box sizes ranging from 2 - 8/i~' comoving Mpc with res- 
olution at z — 3 varying from 93 pc for the smallest box, to 
1.5 kpc for the largest box. In addition, they ran their 8/2 1 
comoving Mpc box with a variety o f resolutions from 190 pc 
to 1.5 kpc. Razou mov et al.l (120061) found a power-law col- 
umn density distribution in their simulations that extends to 
A^hi ~ lO 22 5 cirT 2 without any sign of a turnover. We find a 
similar behavior which extends to even higher column densi- 
ti es du e to the higher resolution of our simulation. 

ICenl (120121) used an adaptive mesh refinement code with 
a zoom-in simulation on a 120/z -1 comoving Mpc box onto 
a cluster and a void of sizes approximately 20/i _I comoving 
Mpc and 30/i~' comoving Mpc respectively. The grid reso- 
lution is 460/j~' pc. The simulations include a Haardt/Madau 
UV background, neutral hydrogen self-shielding, metallicity 
dependent radiative cooling, star formation, supernova feed- 
back, and a molecular hydrogen model based on a solar metal- 
licity HI-H2 transition. They find a column density distribu- 
tion at z — 3 which is similar to observations. The key differ- 
ence of our simulations is that we include a model for HI-H2 
transition, which takes into account the dependence on metal- 
licity. The model used by ICenl (120121) . on the other hand, is 
based on observations of the HI-H2 transition in the Milky 
Way. As a check of the effect of their H2 model, we have 
rep roce s sed ou r neutral hydrogen with the H2 model used 
by ICenl (12012b and found that we also get a similar agree- 
ment with observations. However, this agreement is artifi- 
cial because a solar metallicity model for the HI-H2 transition 



is not applicable to z ~ 3 DLAs, which have metallicities 
Z < O.1Z . Therefore, we can conc lude that th e HI column 
density distribution in simulations of Cen (2012) actually dis- 
agrees with observations. 



lAltav et all (120111) used the set of the OWLS simulations, 
run with Gadget SPH code with subgrid models for star for- 
mation, chemodynamics, galactic winds, cooling in the pres- 
ence of a uniform UV background, and a Milky Way-type H2 
model. They consider column density distribution in a box 
of size 25/i~' comoving Mpc with a gravitational softening 
length of 2.79 comoving kpc. With this set of physical pro- 
cesses, lAltav et all (1201 ll) were able to approximately repro- 
duce the column density distribution at z — 3 across many 
decades in column density, Nya ~ 10 12 5 - 10 22 crrT 2 . No- 
tably, they get the correct turnover in the column density dis- 
tribution at jVhi ~ 10 2L5 cuT 2 . One potential caveat to this 
result is the relatively low resolution of the OWLS simula- 
tions. However, we have re-run our simulations at a simi- 
lar resolution, a minimum cell size of 520 pc at z = 3, and 
found that such a decrease in resolution does not lead to a 
better agreement with the observed column density distribu- 
tion. In addition, the OWLS simulations use a star formation 
prescription which relies on a rigid equation of state which 
can, in principle, prevent the f ormation of high de nsity regions 
within the ISM. Interestingly. lAltav et ail (1201 11) find that the 
HI-H2 transition affects the HI column density distribution 
only at Nm <; 10 2L7 cm' 2 , which is consistent with our find- 
ings. However, since their H2 model is b ased on an empirical 
calibra tion of the molecular fraction by Blitz & Roso lowskvl 
(2006), which used local galaxies, the resulting calibration 
may thus not be applicable to low-metallicit y (Z/Z < 0.1), 
high-redshift DLA systems Fumagalli et al. (201 ( J). Indeed, 
althou gh the galaxy sample studied by Blitz & Rosolowsky 
(2006) included two galaxies with metallicities of ~ 0.2Z o , 
the majority of galaxies in their sample have metallicities 
close to solar. As we noted before, strong dependence of 
the HI-H2 transition on metallicity is both expected theoret- 
ically and is actually observed in measurements of molec- 
ular fraction alo ng stellar lines of sight in the Milky Way , 
LMC, and SMC dTumlinson et alf 2002: Gil lmon et afll2006l: 
Wolfi reeTaT.1120081). 

In|Fumagalli et al.l d201 ll) the authors utilize an old version 
of the ART code, which differs from the version we use in its 
implementation of star formation, cooling, and, stellar feed- 
back. The resolution of their simulations is » 35 - 70 pc de- 
pending on redshift. Fumaga lli et alj yOll) show that their 
simulations successfully reproduce the turnover in the col- 
umn density distribution function. One potential reason for 
this match is their aggressive star formation prescription and 
supernova feedback. The star formation efficiency in their 
simulations is 20-50 times higher than the efficiency we use 
and the efficiency that is estimated for real molecular clouds. 
Likewise, their feedback includes processes that overestimate 
the actual amount of energy that should be injected into the 
ISM. Their aggressive star formation thus likely disrupts re- 
gions of high HI column density gas and may be responsi- 
ble for their agreement with the observed column density dis- 
tribution function. Although we consider the star formation 
and feedback model employed in these simulations extreme, 
the results do indicate that the discrepancy in column den- 
sity distribution between observations and our simulations is 
likely related to the inefficient feedback of our simulations. 
In their appendix, these authors show that inclusion of the H2 



13 



model of lKrumholz et al.l(l2008ll2009bh : lMcKee & Krumholzl 
(2010) is only important above Nm ~ 10 22 cm -2 , once again 
in agreement with our argument that the HI-H2 transition is 
not relevant for the turnover in the column density distribu- 
tion. 

As we can see from this section, several different mech- 
anisms can lead to the correct column density distribution. 
Even if we doubt the physical applicability of these mecha- 
nisms, we can still learn a lot from them. Clearly, star forma- 
tion and feedback have a large effect on the column density 
distribution, while the HI-H2 transition is unimportant for the 
turnover at Nm ~ 10 21 crrT 2 . Therefore, future efforts to cor- 
rectly model the column density distribution will require cor- 
rectly modeling star formation and feedback and, more gen- 
erally, the density structure of turbulent galactic disks. We 
plan to explore physical processes that affect the density struc- 
ture of the ISM using high-resolution simulation models of 
the ISM in a future study. 

6. CONCLUSIONS AND FUTURE DIRECTIONS 

In this study, we have argued that there are two features 
in the column density distribution at high HI column density 
systems. First, there is a turnover at Nm ~ 10 21 cm~ 2 which is 
present in both z — and high-redshift HI column density dis- 
tributions. Second, at low redshifts there is also a lack of high 
column density systems above Nm ~ 10 22 cm~ 2 . This second 
turnover can be plausibly explained by the HI-H2 transition. 
At this point, however, it is not clear whether a similar high- 
column density turnover exists in the high-z distribution as 
probing this regime would require orders of magnitude larger 
absorber samples. In this study we have focused on the uni- 
versal turnover at A^hi ~ 10 21 cirT 2 . 

In § I2.2l we argued that the HI-H2 transition does not con- 
tribute significantly to the turnover. This was supported by 
a comparison of observational evidence at both high and low 
redshift which showed an identical turnover. This is in con- 
trast to what is expected from a HI-H2 induced turnover which 
would be strongly metallicity dependent. Furthermore, we 
showed that even in the local universe, the column density 
distributions within nearby galaxies exhibit a universal shape 
similar to that of the overall column density distribution and 
show little dependence on metallicity. Altogether, this obser- 
vational evidence led us to conclude that the turnover is due to 
some universal properties of HI distribution in galaxies, pre- 
sumably shaped by global processes that set overall structure 
of gaseous disks during their formation. 

In the following section, §[3] we revisited the idea of deriv- 
ing the column density distribution from r andomly o riente d 
disks which was originally consider ed in iMilgroml dl988l) . 
iFall & Peil (fl993l IWolfeet alJ ( [19951) . We extended this idea 
to randomly oriented two dimensional structures in equation 
(0. This extension allows us to take the face-on column den- 
sity distribution of a single galaxy and then average it over 
inclination angles to produce a statistical column density dis- 
tribution. As we saw from Figure [5] this inclination averaged 
column density distribution is remarkably simil ar to the lo- 
cal co lumn density distribution as measured in Zwa an etaTI 
(2005). Furthermore, we showed that this inclination aver- 
aged column density distribution is insensitive to the averag- 
ing scale of the HI map, indicating that the local column den- 
sity distribution itself is fairly insensitive to small-scale fea- 
tures. This implies that the turnover at Nm ~ 10 21 cirT 2 is not 
due to the small-scale features in a galaxy, but rather due to 
the large-scale structure of the HI distribution or, equivalently, 



the radial profile of the galaxy. 

Next, we turned to comparisons of the HI column den- 
sity distribution in our simulations and in observations, which 
showed that our simulations severely overpredict the fre- 
quency of high column density systems, especially above 
A^hi ~ 10 22 cm~ 2 . Since the column density distribution de- 
pends on a variety of physical processes, our failure to match 
the observed distribution teaches us about the shortcomings 
of our simulations. We used the simulations to explicitly 
show how a HI-H2 induced turnover depends on metallic- 
ity. We also showed that dwarf galaxies at z ~ 3 provide a 
dominant contribution to the column density distribution, but 
Milky Way-sized galaxies also contributed some high column 
density sight lines. We showed that the discrepancy is not 
alleviated by increasing resolution. 

We then checked the effect of two important pieces of sub- 
grid physics on the column density distribution: small-scale 
ISM turbulence and gravitational collapse within GMCs. 
Both of these were included through their known effect on 
the column density PDF. We found that for physically real- 
istic parameters, they have a negligible effect. In addition, 
we showed that simulations can be brought in agreement with 
observations if a region of radius 3 kpc around centers of all 
galaxies is removed. This exercise shows that the discrepancy 
is due to excessively high densities in the centers of our sim- 
ulations. Presumably this excess gas should be removed by 
stellar feedback. However, it remains to be seen if realistic 
models of stellar feedback can remove such a large amount of 
gas. 

Finally, we compared our results to the results of several 
recent papers which presented simulation predictions for the 
HI column density distribution. Although some of the simula- 
tions in these studies were successful in reproducing the high 
column density part of the distribution, we believe that this 
success may partially be due to the extreme feedback models 
employed. Despite this, their success indicates the sensitivity 
of the column density distribution to these mechanisms and 
indicates that the high-A^ei tail of the DLA column density 
distribution is a sensitive probe of both gas dynamics in high- 
redshift disks and effects of stellar feedback. Future work can 
therefore fruitfully focus on detailed modeling of these as- 
pects of forming high-z galaxies. 

Sho rtly after the submission of this manuscript, iBraunl 
(120121) considered the effect of 21cm HI self-absorption on 
the column density distribution. He claims that the column 
density distribution is sensitive to b oth resolution, in contra- 
diction with our argument in § |2Jj a nd Fig ure ID and opacity 
corrections. In Figure 6 of IBraunl (120 121) he compares the 
HI column density distribut ion of a large sample of nearby 
galaxies (Zwaan et al. 2005), which made use of ~ 1.5 kpc 
to ~ 3 kpc-scale maps, to the column density distribution of 
a weighted combination of three local galaxies (M31, M33, 
and LMC) corrected for self-absorption, which made use of 
~ 100 pc-scale maps. While these column density distribu- 
tions have a significant difference below Nm = 10 21 cirT 2 , 
opacity corrections are not important at such low column den- 
sities. This indicates that the difference is due to the fact that 
M31, M33, and LMC do not form a representative sample. 
Above Nm = 10 21 cirT 2 the differences are small but since 
the sample used to make the opacity corrected column density 
distribution is not representative, it is not clear how significant 
this difference is. 
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